CS224N Final Project: Detecting Key Needs in Crisis

نویسندگان

  • Tulsee Doshi
  • Emma Marriott
چکیده

When a crisis occurs, the world springs into action to try and understand what is happening and what help is required. During these times, twitter has become a key avenue through which to disseminate information. In this paper, we classify tweets into 18 key categories (such as [volunteering] or [injury]) in order to organize the stream of knowledge coming in. We also attempt to predict when a particular category will be tweeted about by leveraging past tweets and categories to predict future tweet categories. Our most successful result, intermixing a Word2Vec model with a standard feed-forward classifier is able to classify the tweets with 59% accuracy. We find that more sophisticated models (including added depth to a feedforward model and an LSTM that uses a sequence of words to predict a tweet) prove to be less effective than a more generic classification methodology, likely because of overfitting on labels that are more common and 2) noise created by combining various types of crises, such as earthquakes and floods, together. We show that our classification model is more successful when restrained to specific types of events, with 70% accuracy on earthquake data, for example. Additionally, we determine that leveraging past history to predict future tweet patterns is moderately successful. We showcase our LSTM methodology which has achieved a 33% accuracy. Lastly, we test out our model on the more recent twitter compilations for two natural disasters: the earthquake in Italy and Louisiana Floods. We show, via qualitative examples and analysis, that our model can likely scale to natural disasters of various forms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CS224n Final Project

I introduce a novel method for disambiguating word senses using a semisupervised approach. I contrast this method with the current state-of-the-art approaches and show that my approach performs well and could potentially lead to fully unsupervised approaches with high accuracy.1

متن کامل

Model of Process Critique of Contemporary Residential Works Based on the components: "Mission, Objectives, Functional Needs and Concepts

The formation of a designer's personality and the acquisition of design skills are dependent on specific components and variables, such as observation and sampling, analysis and handling of samples, direct involvement and impregnation with the design problem. Architectural critique and analysis of the works of the past and successful and unsuccessful examples have always played a key role in ar...

متن کامل

Identify key factors influencing community-based crisis management to reduce vulnerability due to the outbreak of Coronavirus; case Study: Implementation of Shahid Soleimani project in Joybar Sirajmahale village

Background and objectiv:  With the increase in population around the world and the overuse of environmental capacities, natural disasters have increased in recent years, and every year a large number of people pay exorbitant costs (human, financial) due to natural and unnatural crises. Community -based crisis management as a new approach to improve management in advance, during and after the cr...

متن کامل

CS224N/LING284 Final Project: Measuring Functional Load with Word Vectors

Languages are constantly changing – slowly but surely. The shape of a language in the present day is a reflection of its shape in the past and of processes that have affected it over time. Thus, a fundamental component of understanding the system of language is understanding the forces behind these historical processes. Furthermore, given such an understanding and a representation of the state ...

متن کامل

CS224N Final Project: QA

We extend existing question answering (QA) system to deal with words that are unseen or do not have enough examples during training. Instead of learning word embedding from scratch for a specific QA dataset, we decompose the embedding into two components: The first captures the general semantic meaning of a word and can be trained using readily available large corpuses. The second component rew...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017